Fashion Images Gender Age Vit Large Patch16 224 In21k V3
Apache-2.0
This model is a vision Transformer model fine-tuned on a fashion image gender and age classification dataset based on Google's ViT-Large architecture, achieving 99.6% accuracy on the evaluation set.
Image Classification
Transformers